Rank in Wordlist | Frequency | Word |
---|---|---|
4289 | 3 | 1,2 |
4290 | 3 | 1,5 |
5882 | 2 | 0,8 |
5885 | 2 | 1,6 |
5886 | 2 | 1,8 |
5887 | 2 | 1,85 |
5980 | 2 | 2,5 |
5992 | 2 | 27,42 |
5995 | 2 | 3,2 |
5996 | 2 | 3,8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
9401 | 1 | 10% |
9627 | 1 | 18% |
9661 | 1 | 19% |
9687 | 1 | 2% |
9695 | 1 | 20% |
9745 | 1 | 25,7% |
9756 | 1 | 27% |
9769 | 1 | 3% |
9839 | 1 | 40% |
9845 | 1 | 41% |
Rank in Wordlist | Frequency | Word |
---|---|---|
4934 | 3 | d'Hippone |
6946 | 2 | Tayitchi'out |
7236 | 2 | d'Ivoire |
7237 | 2 | d'Ávila |
7823 | 2 | l'abbaye |
10143 | 1 | Abu-'Abdollâh |
10931 | 1 | Cɩnɛ'lɛ |
12375 | 1 | L'Ami |
12376 | 1 | L'Argent |
12377 | 1 | L'Impératrice |
Rank in Wordlist | Frequency | Word |
---|---|---|
5888 | 2 | 1/3 |
8851 | 2 | µg/L |
9399 | 1 | 1/5 |
9400 | 1 | 1/8 |
9633 | 1 | 1808/1809 |
9703 | 1 | 2010-024/PR |
10641 | 1 | Bq/kg |
18901 | 1 | m3/s |
23370 | 1 | wakɩ/nɔɔyʋ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots